Orientation-boosted Voxel Nets for 3D Object Recognition
نویسندگان
چکیده
Recent work has shown good recognition results in 3D object recognition using 3D convolutional networks. In this paper, we show that the object orientation plays an important role in 3D recognition. More specifically, we argue that objects induce different features in the network under rotation. Thus, we approach the category-level classification task as a multi-task problem, in which the network is trained to predict the pose of the object in addition to the class label as a parallel task. We show that this yields significant improvements in the classification results. We test our suggested architecture on several datasets representing various 3D data sources: LiDAR data, CAD models, and RGB-D images. We report state-of-the-art results on classification as well as significant improvements in precision and speed over the baseline on 3D detection.
منابع مشابه
Object Recognition in 3D Point Cloud of Urban Street Scene
In this paper we present a novel street scene semantic recognition framework, which takes advantage of 3D point clouds captured by a high-definition LiDAR laser scanner. An important problem in object recognition is the need for sufficient labeled training data to learn robust classifiers. In this paper we show how to significantly reduce the need for manually labeled training data by reduction...
متن کاملSTV-based video feature processing for action recognition
Video recordings can provide rich and intuitive information on dynamic events occurred over a period of time such as human actions, crowd behaviours, and other subject pattern changes in comparison to still image-based processes. However, although substantial progresses have been made in the last decade on 2D image processing and its applications such as face matching and object recognition, vi...
متن کاملHarmonic Shape Histograms for 3D Shape Classification and Retrieval
In this paper, we present a novel approach towards 3D shape recognition and retrieval using histograms of rotation invariant local features. Features are extracted for every point of voxelized 3D shape objects by use of functions on spheres which are invariant towards rotation of the object. The fast computation of the local features is performed via convolution methods in frequency space. Hist...
متن کاملSegmentation Assisted Object Distinction for Direct Volume Rendering
Ray Casting is a direct volume rendering technique for visualizing 3D arrays of sampled data. It has vital applications in medical and biological imaging. Nevertheless, it is inherently open to cluttered classification results. It suffers from overlapping transfer function values and lacks a sufficiently powerful voxel parsing mechanism for object distinction. In this work, we are proposing an ...
متن کاملSupplementary Material for “Data-Driven 3D Voxel Patterns for Object Category Recognition”
In building the 3D voxel exemplars, we voxelize a 3D CADmodel into a distribution of 3D voxels. Since 3D CAD models from the web repositories, such as the Trimble 3D Warehouse [1], are usually irregular and not water-tight. We employ the volumetric depth map fusion technique, which is widely used in dense 3D reconstruction in the literature [7], to build the voxel representation of a 3D CAD mod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1604.03351 شماره
صفحات -
تاریخ انتشار 2016